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^ ' Abstract 

Galles and Pearl II1998I claimed that "for recursive models, the causal model framework does not 
add any restrictions to counterfactuals, beyond those imposed by Lewis's [possible-worlds] frame- 
work." This claim is examined carefully, with the goal of clarifying the exact relationship between 
causal models and Lewis's framework. Recursive models are shown to correspond precisely to a 
I subclass of (possible-world) counterfactual structures. On the other hand, a slight generalization of 

recursive models, models where all equations have unique solutions, is shown to be incomparable 
Q ] in expressive power to counterfactual structures, despite the fact that the Galles and Pearl arguments 

should apply to them as well. The problem with the Galles and Pearl argument is identified: an ax- 
iom that they viewed as irrelevant, because it involved disjunction (which was not in their language), 
^ ■ is not irrelevant at all. 

^ ! 1 Introduction 

O ' Counterfactual reasoning arises in broad array of fields, from statistics to economics to law. Not sur- 

prisingly, there has been a great deal of work on giving semantics to counterfactuals. Perhaps the 
best-known approach is due to Lewis II1973I and Stalnaker [1968], and involves possible worlds. The 
idea is that a counterfactual of the form "if A were the case then B would be the case", typically written 
is true at a world w if B is true at all the worlds closest to w where A is true. Of course, making 
this precise requires having some notion of "closeness" among worlds. 

IVTore recently. Pearl 112000 1] proposed the use of causal models based on structural equations for 
reasoning about causality. In causal models, we can examine the effect of interventions, and answer 
questions of the form "if random variable X were set to x, what would the value of random variable Y 
be". This suggests that causal models can also provide semantics for (at least some) counterfactuals. 

The relationship between the semantics of counterfactuals in causal models and in counterfactual 
structures (i.e., possible-worlds structures where the semantics of counterfactuals is given in terms of 
closest worlds) was studied by Galles and Pearl 1 1998] . They argue that the relationship between the 
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two approaches depends in part on whether we consider recursive (i.e., acyclic) models (those without 
feedback — see Section[2]for details). They reach the following conclusion BPearl 20001 p. 242]Q 

In sum, for recursive models, the causal model framework does not add any restrictions to 
counterf actuals beyond those imposed by Lewis's framework; the very general concept of 
closest worlds is sufficient. Put another way, the assumption of recursiveness is so strong 
that it already embodies all other restrictions imposed by structural semantics. When we 
consider nonrecursive systems, however, we see that reversibility [a particular axiom intro- 
duced by Galles and Pearl] is not enforced by Lewis's framework. 

This conclusion seems to have been accepted by the community. For example, in the Wikipedia arti- 
cle on "Counterfactual conditional" (en.wikipedia.org/wiki/CounterfactuaLconditional; Sept., 2009), it 
says "[The definition of counterfactuals in causal models] has been shown to be compatible with the 
axioms of possible world semantics." 

In this paper I examine these claims and the proofs given for them more carefully, and try to settle 
once and for all the relationship between causal models and counterfactual structures. The first sentence 
in the statement above says "for recursive models, the causal model framework does not add any restric- 
tions to counterfactuals beyond those imposed by Lewis's framework". It is not clear (at least to me) 
exactly what this means. Recursive models ai^e a well-defined subclass of causal models. Galles and 
Pearl themselves show that there are additional axioms that hold in recursive models over and above 
those that hold in counterfactual structures. Indeed, they show that the reversibility axiom mentioned 
above is valid in recursive models and is not valid in possible-worlds models. They also show that all 
the axioms of causal reasoning in the possible-worlds framework that they view as relevant (specifi- 
cally, axioms that do not mention disjunction, since it is not in their language) hold in recursive causal 
models. Thus, the only conclusion that can be drawn from their argument is just the opposite to what 
they claim, namely, that the possible-worlds approach does not add any restrictions to causal reasoning 
beyond those imposed by causal models, since causal models satisfy all the axioms that counterfactual 
structures do, and perhaps moreJl 

Pearl [private communication, 2010] intended the clause "for recursive models" to apply to coun- 
terfactual structures as well as to structural models. However, the notion of a recursive counterfactual 
structure is not defined either by Galles and Pearl II1998I or Pearl II2000II . In fact, in general, the notion 
of recursive model as, defined, for example, in [P earl 20001 Definition 7.3.4], does not even make sense 
for counterfactual structures. I show that, if we focus on the language considered by Galles and Pearl 
and counterfactual structures appropriate for this language, there is a well-defined subclass of counter- 
factual strucures that can justifiably be viewed as recursive (counterfactual) structures. I then show that 
precisely the same formulas in this language are valid in recursive causal models and recursive counter- 
factual structures. Put another way, at least as far as Galles and Pearl's language goes, recursive causal 
models and recursive counterfactual structures are equally exprssive. Thus, by restricting to recursive 
counterfactual structures (as Pearl intended), the Galles-Pearl claim is, in a sense correct (although the 
claim does not follow from their argument — see below). However, it should be noted that Galles and 

'The discussion in Section 7.4.2 of IPearl 20001 is taken almost verbatim from IGalles and Pearl 19981 ; since the former 
is more widely available, that is what I cite in this paper 

^Although it is not relevant to the focus of this paper, I in fact also do not understand the second sentence in the Pearl quote 
above. As for the third sentence, while it is the case that reversibility does not hold in counterfactual structures in general, 
reversibility holds in recursive causal structures as well as more general causal structures, as Pearl and Galles themselves show. 
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Pearl's language cannot, for example, express statements with disjunctive antecedents, such as "if he 
had chosen a different running mate or had spent his campaign funding more wisely then he would have 
won the election". The claim applies only to their restricted language. 

Galles and Pearl try to prove their claims by considering axioms; my proof works at the level of 
structures. Specifically, I show that a recursive causal model satisfying a particular formula can be 
effectively converted to a recursive counterfactual structures satisfying this formula, and vice versa. It 
is actually important to work at the level of structures to prove this claim, rather than working at the 
level of axioms. I show that the argument that Galles and Pearl give for their claim is incorrect in some 
significant respects. The problem lies with their claim that axioms involving disjunctive antecendents 
are irrelevant. As I show, they are quite relevant; a proof of a formula not involving disjunction in the 
antecedent may need to use an axiom that does. This possibility is illustrated by considering a class of 
models slightly larger than recursive models. In a recursive model, given a context, there is a unique 
solution to every equation. In earlier work |Halpem 2000 1, 1 showed that there are non-recursive causal 
models where every equation has a unique solution. Galles and Pearl's argument applies without change 
to causal models where the equations have a unique solution. However, as I show here, these models are 
actually incomparable in expressive power to counterfactual structures. The reversibility axiom remains 
valid in causal models where the equations have a unique solution. But, as I show by example, there 
is a formula that does not involve disjunction that is valid in counterfactual structures but is not valid 
in causal models where the equations have a unique solution (and, a fortiori, also not valid in recursive 
causal models). Not surprisingly, proving that this formula is valid from the axioms requires the use of 
an axiom that involves disjunction. 

These results show that the quote from Wikipedia is not quite true. While it is true that recursive 
causal structures are, in a certain sense, compatible with the axioms of possible worlds semantics, the 
slightly more general class of causal structures with unique solutions to the equations is not. Thus, in 
general, the semantics of counterfactuals in causal structures cannot be understood in terms of closest 
worlds. 

The rest of this paper is organized as follows. In Section |2l I review the causal models and coun- 
terfactual structures. The main technical results are in Section |3l which also includes a discussion of 
the problems with the Galles-Pearl argument. I conclude in Section |4] with some discussion of these 
results. 



2 Causal models and possible- worlds models: a review 

In this section I review the relevant material on causal models and possible-worlds models. 
2.1 Causal models 



(The following discussion is taken, with minor modifications, from |Halpem 2000|.) Causal models 
describe the world in terms of random variables, some of which have a causal effect on others. It is 
conceptually useful to split the random variables into two sets, the exogenous variables, whose values 
are determined by factors outside the model, and the endogenous variables, whose values are ultimately 
determined by the exogenous variables. The values of the endogenous variables are characterized by a 
set of structural equations. 
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For example, if we are trying to determine whether a forest fire was caused by lightning or an 
arsonist, we can take the world to be described by four random variables: 

• FF for forest fire, where FF = 1 if there is a forest fire and FF = otherwise; 

• L for lightning, where L = 1 if lightning occurred and L = otherwise; 

• MD for match dropped (by arsonist), where MD = 1 if the arsonist dropped a ht match, and 

MD = otherwise; 

• E, which captures the external factors that determine whether the arsonist will drop a match, or 
whether lightning will strike. 

The variables FF, L, and MD are endogenous, while E is exogenous. If we want to model the fact 
that if the arsonist drops a match or lightning strikes then a fire starts, then we would have the equation 
FF = max(L, MD); that is, the value of the random variable FF is the maximum of the values of the 
random variables MD and L. This equation says, among other things, that if MD = and L = 1, then 
FF = 1. Alternatively, if we want to model the fact that a fire requires both a lightning strike and a 
dropped match (perhaps the wood is so wet that it needs two sources of fire to get going), then the only 
change in the model is that the equation for FF becomes FF = min(L, MD); the value of FF is the 
minimum of the values of MD and L. The only way that FF = 1 is if both L = 1 and MD = 1. 

Formally, a signature <S is a tuple {U, V, TZ), where U is a finite set of exogenous variables, V is a 
finite set of endogenous variables, and TZ associates with every variable X £ U L)V a. finite set TZ(X) 
of possible values for X (the range of possible values of X). A causal model is a pair T = (5, J"), 
where <S is a signature and F associates with each variable X G V a function denoted Fx such that 
Fx '■ ( X ze{uuV-{x})T^{Z)) — ^ 'R.{X). Fx characterizes the value of X given the values of all the other 
variables inUUV. Because Fx is a function, there is a unique value of X once all the other variables are 
set. Notice that we have such functions only for the endogenous variables. The exogenous variables are 
taken as given; it is their effect on the endogenous variables (and the effect of the endogenous variables 
on each other) that is modeled by the structural equations. 

Given a causal model T = (5, J"), a (possibly empty) vector X of variables in V, and a vector x of 
values for the variables in X, we can define a new causal model denoted T-^^^. Intuitively, this is the 

causal model that results when the variables in X are set to x. Formally, T^^^ = {S, J""^^^}), where 

j:X=x identical to T except that the equation for X is replaced by the equation X = x. The model 
r^^^ describes a possible counterfactual situation; that is, even though, under normal circumstances, 
setting the exogenous variables to u results in the variables X having value x' ^ x, this submodel 
describes what happens if they are set to x due to some "external action", the cause of which is not 
modeled explicitly. 

In general, given a context u, that is, a setting for the exogenous variables, there may not be a 
unique vector of values that simultaneously satisfies all the equations in T^^^; indeed, there may not 
be a solution at all. One special case where the equations in a causal model T are guaranteed to have 
a unique solution is when there is a total ordering -<t of the variables in V such that if X -<t Y, then 
Fx is independent of the value of Y; i.e., Fx{- . . ,y,. . .) = Fx{- . . ,y',. . .) for all y, y' G Tl^Y). In 
this case, T is said to be recursive or acyclic. Intuitively, if T is recursive, then there is no feedback; if 
X -<T Y, then the value of X may affect the value of Y, but the value of Y has no effect on the value 
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of X. It should be clear that if T is a recursive model, then, given a context u, there is always a unique 
solution to the equations in T-g^^, for all X and x, (We simply solve for the variables in the order given 
by ^T-) 



Following IHalpem 2000 1, I consider three successively more general classes of causal models for 



a given signature S (with a focus on the first two): 

1. Treci-S): the class of recursive causal models over signature S; 

2. 71iniq(»5): the class of causal models T over S where, for all X C V, x, the equations in T^^^ 
have a unique solution for all contexts u; 

3. T{S): the class of all causal models over S. 

I often omit the signature S when it is clear from context or irrelevant, but the reader should bear in 
mind its important role. 



Syntax and Semantics: In IHalpem 2000 1, a number of languages for reasoning about causality are 
considered. The choice of language is significant. As Galles and Pearl already point out, we cannot 
in any obvious way give a meaning in causal models to counterfactual implications where there is a 
disjunction on the left-hand side of the implication, that is formulas of the form {A V A') ^ B. Thus, 
our results effectively consider a language with no disjunction on the left-hand side of ^. As mentioned 
in the introduction, one of the results of this paper have the form "for every recursive causal model, 
there is a recursive causal structure that satisfies the same formulas", and vice versa. For this result to 
have any bite, we must choose a reasonably rich language. 



In IHalpem 2000 1, 1 considered a number of languages appropriate for reasoning about causality in 
causal models. I briefly review two of them here. The languages are parameterized by the signature S. A 
basic causal formula is one of the form [Yi = yi, . . . , = yu]'^, where is a Boolean combination of 
formulas of the form X = x, Yi, . . . , y^, X are variables in V, Yi, . . . , Yfc are distinct, and x € TZ{X). 
I typically abbreviate such a formula as [y = ip. The special case where A; = is abbreviated as (/?. 
A causal formula is a Boolean combination of basic causal formulas. Let C'^{S) consist of all causal 
formulas over the signature S. (Again, I often omit the signature S if it is clear from context or not 
relevant.) 

Roughly speaking, we can think of C^{S) as the language that results by starting with primitive 
propositions of the form X = x, where X is a random variable in V and x G TZ{X), and closing 
under modal operators of the form \Y = y\. The restriction to primitive propositions of this form is 
not a major one. Given a propositional language with primitive propositions Pi, ... , f„, we can define 
binary random variables Xi, . . . , X„ (i.e., variables whose range is {0, 1}) and identify Xi = 1 with 
"Pj is true". That is, as long as we can we define structural equations that characterize how a change in 
one primitive propositions affects the other primitive propositions, taking the primitive propositions to 
have the form X = x is not a major restriction. 

The formula \Y = y\X = x can be interpreted as "in all possible solutions to the structural equations 
obtained after setting Yi to yj, i = 1, . . . , A;, and the exogenous variables to u, random variable X has 
value x". This formula is true in a causal model T in a context u if in all solutions to the equations in 
T'y^j^ in context u, the random variable X has value x. Note that this formula is trivially true if there 
are no solutions to the structural equations. 
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A formula in C~^{S) is true or false in a causal model in T(5), given a context u. As usual, we 
write (T, u) |= if the causal formula 99 is true in causal model T given context n|l For a basic causal 
formula [Y = y\(p, we define (T, u) \= [Y = y\ip if (p holds in all solutions to the equations T^=y with 
the values of the variables in U set to u. Thus, for example, if has the form Xi = xi V X2 = X2, then 
(T, u) ^ 93 iff every vector of values for the endogenous variables that simultaneously satisfies all the 
equations in T has either Xi = xi or = X20 We define the truth value of arbitrary causal formulas, 
which are just Boolean combinations of basic causal formulas, in the obvious way: 

• T \= ifi A ip2 if T ^ ipi and T \= ip2 

• T^^ififT^ip. 

As usual, a formula ip is said to be valid with respect to a class T' of causal models if T \= p for all 
T G v. 



is the most general language that I consider in [Halpem 2000 1. To compare my results to those 
of GP, who use a more restricted language, I also consider some restrictions of £+. Specifically, let 
>Cuniq be the sublanguage of that consists of Boolean combinations of formulas of the form [Y = 
y\{X = x). Thus, the difference between >Cuniq and is that in £uniq> only X = x is allowed after 
[Y = y\, while in arbitrary Boolean combinations of formulas of the form X = x aie allowed. 



As the following lemma, proved in [Halpem 2000 1, shows, for reasoning about causality in Tuniq, the 
language C^niq is adequate, since it is equivalent in expressive power to 



Lemma 2.1 [ Halpem 2000| In Taniq cmd Tree, the language C'^ and /luniq ci^^ expressively equivalent; 
for every formula (p € C'^, we can effectively find a formula ip' € >Cuniq such that T^niq N V V^'- 

The equivalence described in Lemma 12.11 no longer holds when reasoning about causality in the 
more general class T of structures. However, since I focus in this paper on T^ec and 7^niq» in the rest of 
the paper I consider the language >Cuniq; this suffices to make my points. 



Axiomatizations I briefly recall some standard definitions from logic. An axiom system AX consists 
of a collection of axioms and inference rules. An axiom is a formula (in some predetermined language 
C), and an inference rule has the form "from pi, . . . ,pk infer ipT where 991, . . . , yj^, are formulas in 
C. A proof in AX consists of a sequence of formulas in C, each of which is either an axiom in AX or 

''in (Halpem 2000| , following IGalles and Pearl 19981 . the context was included in the formula. The definition of (T, u) \= 
X = X given here is intended to be equivalent to that ofT \= X{u) — x. The advantage of having u on the left-hand side 
of 1= (which is also the formalism used in (Halpem and Pearl 20051) is that it enforces the intuition that the context consists 
of background information that is typically suppressed. It does lead to loss in expressive power, since it is not possible to 
consider formulas of the form Xi{ui) = xi A X2{u2) = X2, where ui 7^ U2. But this tums out to be an advantage — see the 
next footnote. 

''in (Halpem 2000[ , truth was defined for formulas of the form [Y = y\{X — x), and extended in "the obvious way" to 
Boolean combinations; that is V 1^2 was taken to be true if either tpi was true or ip2 was true. This is equivalent to the 
approach described above as long as all equations have a unique solution, that is, in Tiiniq. However, the two approaches are 
not equivalent if equations can have more than one solution. The approach suggested here is what is required to make the 
axioms given in ( Halpem 2000[ sound in the general case. We do not want to say that Xi = xi V X2 = X2 is tme if Xi — xi 
is true in all solutions to the equations or X2 = X2 is tme in all solutions to the equations; rather, we want either Xi = xi or 
X2 = X2 to be tme in all solutions. With the approach given above, it is not clear how to give semantics to formulas such as 
Xi (mi ) = Xi V X2 (u2 ) = X2 'iiu\ ^ U2, which is perhaps another good reason for not allowing such formulas. 
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follows by an application of an inference rule. A proof is said to be a proof of the formula ip if the last 
formula in the proof is ip. We say if is provable in AX, and write AX h ip, if there is a proof of ip in AX; 
similarly, we say that ip is consistent with AX if -k/? is not provable in AX. 

An axiom system AX is said to be sound for a language C with respect to a class T' of causal models 
if every formula in C provable in AX is valid with respect to T'. AX is complete for C with respect to 
T' if every formula in C that is valid with respect to T' is provable in AX. 



Consider the following axioms, taken from |Halpem 2000 1, modified slightly for the language used 
here: 

CO. All instances of propositional tautolog ies in the language >Cuniq- 

CI. [Y = y\{X = x)^^[Y = y\{X = x'),ifx,x' en{X),x^x'. (Equality) 

C2. V^.g7e(x)[? = = x). (Definiteness) 

C3. {[X = x]{W = w)h {[X = x\{Y = y))^[X = x]W = w]{Y = y)).^ (Composition) 

C4. [X = x;W = w]{X = x). (Effectiveness) 

C5. {[X = x;W = w]{Y = y) h[X = x;Y = y]{W = w)) 

^[X = x\{Y = y), iiY i^W^ (Reversibihty) 

The key axioms C3-C5 were introduced (and named) by Galles and Pearl 1119981 . Perhaps most relevant 
to this paper is the reversibility axiom, C5. It says that if setting X to x and W iow results in Y having 
value y and setting X to x and y to y results in W having value w, then Y must already have value y 
when we set X to x (and W must already have value w). 

Let AXuniq(5) consist of C0-C5 and the rule of inference modus ponens (from (p and ip ^ ijj infer 

V')- 



Theorem 2.2 f| Halpem 2000 1) AXuniq(5) is a sound and complete axiomatization for £umq('5) with 
respect to 7^niq(»5). 

Using Lemma l2?n it is possible to get a complete axiomatization for with respect to Tu_niq{S), 
simply by adding axioms for converting a formula in to an equivalent formula in £uniq- A complete 



axiomatization for -Cuniq with respect to %cc{S) is also given in |Halpem 2000 1; it requires adding a 
somewhat complicated axiom called C6 to AXuniq that captures acycUcity. The details are not relevant 
for our purposes, so I do not discuss C6 further. 

Finally, a complete axiomatization for C'^ with respect to T{S), the class of all causal models, is 
given. The axioms are similar in spirit to those in AXuniq- In particular, there is the following analogue 
to reversibility (where {X = is an abbreviation for -^[X = x\-^ip): 

{{X = x;Y = y){W = w h Z = z) h {X = x;W = w){Y = y h Z = z)) 
^ {X = x){W = w ^Y = y ^ Z = z)), where Z = V -{X\J{W, Y]). 



'Galles and Pearl use a stronger version of C3: {[X = x\{W = w) ^ (\X = x](Y = y)) <^ [X = x;W = w](Y = y)). 
The stronger version follows from the weaker version in the presence of the other axioms. (This is actually shown in the proof 
of ProDosition l3.3l ) 

*The assumption that Y ^ W was not made by Galles and Pearl 119981 nor by Halpem 120001 . but it is necessary. For 
example, ifW = Y, then one instance of C5 would be {[X = x;Y = y]{Y = y) A [X = x;Y = y](Y = y)) => [X = 
x\{Y = y). The antecedent is true by C4, while the conclusion is not true in general. 
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2.2 Possible-worlds models for counterfactuals 



There have been a number of semantics for counterfactuals. I focus here on one due to Lewis II1973II . 
Let be a finite set of primitive propositions. A counterfactual structure M is a tuple (il,/?, vr), 
where 17 is a finite set of possible worlds^ vr is an interpretation that maps each possible world to 
a truth assignment over and i? is a ternary relation over 17. Intuitively, {w,u,v) S i? if u is as 
close/preferred/plausible as v when the real world is w. Let u :<^ai v be an abbreviation for {w, u, v) € R, 
and define = {u : u :<w G 7^ for some v G Q,}; thus, the worlds in are those that are at least 
as plausible as some world in Q, according to Define v v' if v v' and v' v. We require 
that w G that be reflexive and transitive on Q.^,, and that w -<w u for all u ^ w. Thus, -<u, puts 
an ordering on worlds that can be viewed as characterizing "closeness to w", and w is the closest world 
to itself. 

Let C'~^{^) be the language formed by starting with <I> and closing off under A, -■, and ^ (where 
>z denotes counterfactual implication). The language allows arbitrary nesting of counterfactual 
implications. By way of contrast, the language £+ and its sublanguages have only one level of nesting, 
if we think of [X = x](p as {X = x) >: (p. Let be the sublanguage of C*-^ consisting of all formulas 
with no nested occurrence of >z (including formulas with no occurrence of >z at all). 

We can give semantics to formulas in C'^ (and hence Cf) in a counterfactual structures M = 
{Q, R, vr) as follows. The first few clauses are standard: 

• (M, w) \= p, when p G if it{w){p) = true. 

• (M, w) \= (fAil^if (M, w) ^ip and (M, w) \= ip. 

• (M, w) \= -199 if it is not the case that (M, w) \= if. 

To give semantics to ip t/j, assume inductively that we have already given semantics to ip at all worlds 
in M. Define closest m{w, ip) = {v ^ Qu> '• {M, v) \= if and there is no world v' such that (Af , v') \= ip 
and v' v}. Thus, closest m{w, ip) is the set of worlds closest to w where if is true. (Notice that if 
there are no worlds where ip holds, then closest m{w, p) = 0.) 

• (M, w) \= ip^i^if {M, v) \= iIj for all v G closestM{w, p)- 



Axioms: There are a number of well-known sound and complete axiomatizations for counterfac- 
tual logic (see, e.g., |Burgess 19811 IBell 19891 IChellas 19801 IGrahne 19911 IKatsuno and Satoh 19911 
ILewis 197n [Lewis 19731 ILewis 19741 ). Here is one, based on Burgess's axiomatization, similar in 



spirit to the well-known KLM properties |Kraus, Lehmann, and Magidor 1990 1. 



AO. All instances of propositional tautologies in the language C 
Al. p^p. 

A2. ((v? >= V'l) A ^ V2)) ^{p^ (V'l A ^2))- 



c 



^Giving semantics to counterfactual formulas in structures with infinitely many worlds adds an extra level of complexity. 
As shown in (Friedman and Halpem 1994) , if a formula in the language I am about to introduce is satisfiable at all, it is 
satisfiable in a structure with finitely many worlds, so as far as validity goes, there is no loss of generality in restricting to finite 
structures. 
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A3. (((^1 h ^2) A {ifi >z V')) ^ ((V^i A ip2) >z V')- 
A4. {{ipi >i ^) A (v?2 >= V')) ^ ((<^i V <^2) ^ V')- 
A5. ^{true >z false). 
A6. ^ (V' <S=> (95 ^ V^))- 

There are three rules of inference: modus ponens, and the following two rules: 

RAl. From ip <^ ip' infer {p h tp) ^ {p' h ip)- 
RA2. From tp ^ tp' infer {ip h ip) ^ ip')- 

A1-A4 and RA1-RA2 correspond to the KLM postulates REF (for reflexivity), AND, CM (cautious 
monotonicity), OR, LLE (left logical equivalence), and RW (right weakening), respectively. A5 captures 
the requirement that is nonempty. Finally, A6 is the axiom that makes >z a counterfactual operator, 
and not just a "normality" or "typicality" operator (so that "normally birds have wings" becomes bird >z 
wing). Suppose that it is the case that if ip were true, then ip would be true. Then if tp is actually true, 
we would expect ip to be true. Moreover, if p and ^p are both true, then it seems reasonable to assert that 
if p were true, then ip would be true. On the other hand, it is not in general the case that if ip and ^p are 
both true, then 93's are normally or typically ^p's. 

In his semantics, Lewis allows there to be more than one world closest to a world w where ip is true 
(except in the special case that ip is true at lu; in this case, w is the unique world closest to w satisfying 
p>). By way of contrast, Stalnaker |1968] essentially assumes that for each world w and formula ip, 
there is a unique world closest to w satisfying p>. The standard approach to getting uniqueness is to 
require that be a strict total order (whose least element is w); that is, for all worlds w' ^ w" , either 
w' w" or w" w'- This assumption is captured by the following axiom: 

A7. {ip h (tpi V tP2)) ((v? h V'l) V ^ V2))- 

Let AX be the axiom system consisting of axioms A0-A6 and rules of inference RAl, RA2, and 
modus ponens; let AX' be AX together with the axiom A7. Let M{^) be the collection of all counter- 
factual structures over the primitive propositions in $ (that is, structures where vr interprets formulas in 
^); let A4^{^) be the subset of M.{^) consisting of all counterfactual structures where is a total 
strict order. As usual, I omit the $ when it is clear from context or irrelevant. 



Theorem 2.3 |Burgess 1981 1 AX (resp., AX') is a sound and complete axiomatization for the language 
-^uniq with respect to M (resp., ^A^). 



3 Relating causal models to counterfactual structures 

As the title suggests, in this section I take a closer look at the relationship between causal models 
and counterfactual structures. Given a signature S = {U,V,TZ), consider the set $5 of primitive 
propositions of the form X = x foi X ^ V and x € lZ{x). We restrict to counterfactual structures 
M = (O, R, vr) for this set of primitive propositions, where tt is such that, for each world w; G and 
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variable X £ V, exactly one of the formulas X = x is true. Call such structures acceptable. In an ac- 
ceptable counterfactual structure, a world can be associated with an assignment of values to the random 
variables. An acceptable structure (17, R, vr) is full if, for each assignment v of values to variables and 
all w £ Q., there is a world in 0.^) where v is the assignment. (I discuss the consequences of fullness 
shortly.) Let M.a{^s) consist of all acceptable counterfactual structures over S; let M.f{(^s) consist 
of all full acceptable counterfactual structures over S; let M^i^s) = M.a{^s) H M^{<^s)\ and let 

As before, I identify \Y = y\{X = x) (£ >Cuniq(5) with the formula Y" = y ^ (X = x) G ^{{^s)- 
I abuse notation and use jCuniqC'?) to denote the sublanguage of >Cf (<I>5) that arises via this identification. 
The following is easy to show. 

Proposition 3.1 C3 and C4 are valid in Ma{^s): C2 is valid in M^{^s): CI is valid in Mf{(ps)- 
Notice that, in Ma{^s) (and hence all of its subsets), the following two formula schemes are valid: 

VI. y^enx)X = X 

V2. If X / x', then X = x ^ X ^ x' . 
In 7Wj(<I>5), the following axiom is vahd: 

V3. ^[X = x]false. 

V3 is valid in a structure where, for all worlds w, there is a closest world to w such that X = x, that is, 
closest m{w, X = x) ^ %, which is exactly what fullness ensuresj^ 

With this background, I can give an "axiomatic" proof of Proposition 13.11 

• CI follows easily from AO, A2, A5, and V2. 

• C3 is a special case of A3. 

• C4 follows easily from Al and RA2. 

• C2 is not provable in AX (and is not valid in 7V4a($5)), but it is provable in AX' together with the 
axiom VI. Indeed, it follows easily from Al, A7, RA2, and VI (since =^ '^x&'R(x)^ = x is 
valid in M.{(^s))- The fact that C2 requires A7 is not surprising. C2 is essentially expressing the 
uniqueness of solutions, which is being captured by the assumption that there is a unique closest 
world where the antecedent is true, an assumption captured by A7. 

Reversibility (C5) is conspicuously absent from this list. Indeed, as the following example shows, C5 is 
not sound in counterfactual structures. 

Example 3.2 Suppose that V = {Xi, X2, X3}, and TZ{Xi) = n{X2) = 11{X^) = {0, 1}. Consider a 
structure (17, ii, vr) € A4j where there is exactly one world in Q. corresponding to each of the 8 possible 
assignments of values to the variables. Thus, we can identify a world w inO. with a triple (5i, 62, ^3). 

'^Galles and Pearl did not discuss the axioms V1-V3, but it is clear that they are implicitly assuming that they hold. 
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where, in w, Xi = bi. All that matters about R is that (0, 0, 0) ^(o,o,o) (l^ Oi 0) ^(o,o,o) (1) !)> with 
the other 5 worlds being further from (0, 0, 0) that (1, 1, 1). Then it is immediate that 

(M, (0,0,0)) ^ [Xi = l;X2 = 1](X3 = 1) A [Xi = 1;X3 = 1]{X2 = 1) A [Xi = 1]{X2 = 0). 

Thus, C5 is violated. | 

For the remainder of this section, fix 5 = {U,V,7i). Say that a counterfactual structure M = 
{Q.,R,'k) in M.'^{(^s) is recursive if, for, for each world w ^ there is a total ordering -<j\/ of the 
variables in V such that ifW -<m Y , then for all X C V — {Y, W}, in the closest world to w where 
X = X and Y = y, the value of W is the same as in the closest world to w where X = x. Intuitively, 
setting Y to y has no further effect on the value of W once X is set to x. Let A^rec consist of the 
recursive structures in M^. It is easy to see that the structure M considered in Example 13.21 is not in 
A^rcc- For if it were, then we would have either X2 -<m or X2. If X-^ -<m X2, then the 

value of X3 would have to be the same in the closest world to (0, 0, 0) where Xi = I and = 1 as in 
the closest world to (0, 0, 0) where Xi = 1. But it is not. The same problem occurs if X2 -<m ^i- 

Proposition 3.3 C5 is valid in A^rec- 

Proof: Suppose that M = vr) € Mrec, w ^ Q., and the antecedent of C5 holds at {M,w). If 

Y W, then it is immediate from the fact that {M,w) \= [X = x;Y = y]{W = w) that we 
also have {M,w) \= [X = x]{W = w). Now suppose that W Y- Then from {M,w) \= [X = 
x'W = w]{Y = y) it follows that {M,w) \= [X = x\{Y = y). Suppose, by way of contradiction, 
that {M^w) \= [X = x\{W = w') for some w' 7^ w. By C3 (which, by Proposition 13. 1[ is sound 
in M), it follows that {M,w) [X = x;Y = y]{W = w'). This, combined with the fact that 
{M,w) \= [X = x]Y = y\{W = w), contradicts CI (which, by Proposition 13.11 is also sound in 
M). (Note that this ai^gument actually shows that the stronger version of C3 used by Galles and Pearl, 
discussed in Footnote 4, follows from the weaker version used here.) | 

Not surprisingly, the proof of Proposition [33]is essentially identical to the argument given by Galles 
and Pearl that reversibility holds in recursive structures. Indeed, it is not hard to show that the axiom 
C6 that characterizes recursive structures is valid in A(rec. and C5 follows from the other axioms in the 
presence of C6. 

Even more can be shown. In a precise sense, every causal model in %cc{S) can be identified 
with a counterfactual structure in Mt-cc{S) where the same formulas are true. It follows that recursive 
counterfactual structures are at least as general as recursive causal models. The converse is also true. 1 
now make these claims precise. 

Given a causal model T = {S, T) e %cc, we construct a model Mt = (J^, R, tt) as follows. Let Q 
consist of all the assignments of values to the variables mUVJV. The interpretation vr is defined in the 
obvious way: X = x is true in w if w assigns X value x. For each context u, let be the assignment 
to the variables in V that is forced by the equations. (Since T € Tree, is uniquely determined.) More 
generally, for each assignment X = x, let be the assignment to the variables in V determined 

by the equations T^^^ in the context u. Let = {u, v^) and let w^-^^^ = (n, v^ji^^). Finally, let 

R be such that the closest world to where X = xis w and for all assignments u and v to the 
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exogenous and endogenous variables, respectively, consists of all worlds (n, v') such that v' is an 
arbitrary assignment to the endogenous variables. This does not uniquely determine R. Indeed, it places 
no constraints on if w is not of the form and does not completely determine R even if w does 
have the form w^. It is easy to define a relation R that satisfies these constraints such that (1) is a 
strict total order on 0^^,^ and (2) for w not of the form w^, is a strict total order on Q.^, that satisfies 
the recursiveness constraint. 

Theorem 3.4 Mt G Mrec and, for all formulas Lp e CumqiS), we have (T, u) \= ip iff^Mx, w^) \= ip. 

Proof: It is easy to see that we can take ^j\/j,=^t> so Mt € A^rec- Using the definition of R, it is easy 
to show by induction on the structure of formulas that (T, u) \= ip iff {Mt^wh) \= Lp.\ leave the details 
to the reader. | 

Theorem 13.41 shows that we can embed %cc in A^roc- I next give an embedding of A^roc in %cc- 
Now the causal model in Tvcc depends both on the counterfactual structure and a world in that structure. 
(I discuss why this is so after Theorem 13.51 ) Suppose that M = vr) G Alrec('5) and w G 

Suppose that V = {^i, • • • , Xn} and, without loss of generality, that Xi Xj iff i < j. Consider 
the causal model Tm,w = (5, T), where Fx is defined by induction on the ^Af-ordering. That is, 
we first define Fx^, since Xi is the ^-minimal variable, then define Fx2, and so on. Suppose that 
(M, tu) \= Xi = xi- Then define Fx^ so that, for all contexts u and assignments v to the variables 
in V — {^i}, Fxi{u,v) = xi. Suppose that we have defined Fx^ for i < k. Define -Fx^+i so that, 
for all contexts u and all assignments v to the variables in V — {Xj^^i}, we have i^x^+i {u, v) = x iff 
{M,w) \^[Xi = vi-...,Xk = Vk]{Xk+i = x). 

Theorem 3.5 Tm,w G %ec and, for all formulas ip € C^mqiS) and all contexts u, we have (M, w) \= ip 
iff{TM,w,u) \= ip. 

Proof: It is easy to see that Tj\/ „, G Tree, the definition of T guarantees that „ =^m- The definition 
of Fx is independent of the context; it easily follows that for all contexts u and u' , we have (T, u) \= ip 
iff (T, n!) \= ip. Again, an easy induction on the structure of (p shows that, for all contexts u, (M, w) \= 
ip iff (T, u)^ip.\ 

It is easy to modify the construction slightly so that each context u corresponds to a different world 
it) G fi. Thus, if the number of contexts is at least then we can get a closer analogue to Theorem 13. 4[ 
where we can associate with each world w G a context and show that (M, vS)\= ip iff (T, u^) |= 
ip. However, Theorem 13 .5 1 suffices for the following corollary. 

Corollary 3.6 The same formulas in >Cuniq(*^*5) are valid in both %cc{S) and Aircc{S). 

Proof: If <p is not valid in Tvcc, then there is some causal model T G Tree and context u such that 
{T,u) \= -!(/?. By Theorem 13.41 {Mt,w^) \= -^ip, so Lp is not valid in Alrec- For the converse, if (p 
is not valid in M-rcc, then there is some counterfactual structure M G M.rcc and world w such that 
(M, w) \= -!(/?. By Theorem [331 {Tm^w, u) \= -199. Thus, the same formulas are satisfiable in both Tec 
and M-rec, and hence the same formulas are valid. | 
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What happens if we consider T^niq rather than T^ec? Now causal models and counterfactual struc- 
tures are incomparable. Consider the formula 

V=de{ [Xi = 1](X2 = 1 AX3 = 0)A 
[X2 = 1](X3 = 1 AXi =0)A 
[X3 = l](Xi = 1 AX2 = 0). 

Theorem 3.7 -up is valid in %ec and M. / (and hence M'f), but ip is satisfiable in T^miq- 

Proof: 1 first show that is valid in Tree- Consider a causal model T G Tree, and suppose, by way of 
contradiction, that (M, w) \= tp. Let <t be the total ordering on the variables in V in T. One of Xi, 
X2, and X3 must be minimal with respect to -<t- Suppose it is Xi and that (T, u) \= Xi = i. Then 
we must have {T,u) ^ [X2 = l]{Xi = i) and (T, n) [= [X3 = = i). But since (T, n) [= ip, 

it follows that {T,u) \= [X2 = l]{Xi = 0) and (T, n) ^ [X3 = l]{Xi = 1). Thus, Xi cannot be 
minimal with respect to -<t- An analogous argument shows that X2 and X3 also cannot be minimal 
with respect to -<t- Thus, we have a contradiction. 

I next show that -up is vahd in M.f. Suppose by way of contradiction that M e M.f and (M, w) \= 

ip. Consider a world w' closest to w that satisfies Xi = 1 V X2 = 1 V X3 = 1. (Since M G A^j, 
there is guaranteed to be such a world.) Suppose that (M, w') \= Xi = I. Note that w' must be one 
of the worlds closest to w that satisfies Xi = 1. Since (M, w) \= [Xi = 1]{X2 = 1), we must have 
{M,w') \= X2 = 1. Thus, w' must also be one of the worlds closest to w that satisfies X2 = 1. 
Since {M,w) \= [X2 = l]{Xs = 1), we must have that {M,w') \= X^ = I. On the other hand, 
since {M,w) \= [Xi = l]{Xs = 0), we must have have that {M,w') \= X^ = 0. This gives a 
contradiction. A similar contradiction arises if (M, w') ^ X2 = 1 or if (M, w') \= X3 = 1. Since 
(M, w') 1= Xi = 1 V X2 = 1 V X3 = 1 by construction, this gives a contradiction to the assumption 
that (M, w) \= (p. 

Finally, 1 must show that there is a causal model M G T^niq and a context u such that (M, u) \= tp. 
Let V = {Xi,X2,X3} and let W = {U}. Take 7^(Xl) = 7^(X2) = 7e(X3) = {0, 1} and 7^(C/) = 
{0}. In defining T, I write Fi instead of Fx^, and omit the U argument (since it is always 0). Thus, 
Fi (0, 0) = means that when ^2 = ^3 = 0, then Xi = 0. Define 

• Fi(0,0) =0; Fi(0,l) = 1; Fi(l,0) =0; = 0; 

. F2(0, 0) = 0; F2(0, 1) = 0; F2(l, 0) = 1; ^2(1, 1) = 0; 
. Fs{0, 0) = 0; F3(0, 1) = 1; ^3(1, 0) = 0; ^3(1, 1) = 0. 

I must now verify that M = (<S, G Tnniq, and that (M, 0) \= tp. This is straightforward, although 
tedious. First observe that (0, 0, 0) is the only solution to all the equations in the basic causal model. It 
is easy to see that (0, 0, 0) is a solution. To see that it is the only solution, observe that (0, 0, 1) cannot 
be a solution because .^3(0, 0) = 0; similarly, (0, 1, 0) and (1, 0, 0) cannot be solutions; (0, 1, 1) cannot 
be a solution because -^2(0, 1) = 0; (1, 1, 0) caimot be a solution because -Fi(l, 0) = 0; (1, 0, 1) cannot 
be a solution because ^3(1, 0) = 0; finally, (1, 1, 1) caimot be a solution because Fi{l, 1) = 0. 

It is clear that there must be a unique solution if we set two of the three variables (forced by the 
equation for the third variable); for example, if Xi = 1 and X2 = 0, then the unique solution is 
(1, 0, 0). If Xi, X2, or X3 is set to 0, then one solution is (0, 0, 0). The solution is unique for the same 
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reasons that (0, 0, 0) was the unique solution to the original collection of equations. Finally, if Xi = 1, 
then (1, 1,0) is a solution; if X2 = 1, then (0, 1, 1) is a solution; and if X3 = 1, then (1,0, 1) is a 
solution. We must show that there are no other solutions in all three cases. If Xi = 1, (1,0,0) is not 
a solution since ^2(1, 0) = 1; (1, 0, 1) is not a solution because ^3(1, 0) = 0; and (1, 1, 1) is not a 
solution because ^3(1, 1) = 0. If X2 = 1, (0, 1, 0) is not a solution because ^3(0, 1) = 1; (1, 1,0) is 
not a solution because Fi(l, 0) = 0; (1, 1, 1) is not a solution because Fi(l, 1) = 0. Finally, if X3 = 1, 
(0,0, 1) is not a solution because -Fi(0, 1) = 1; (0, 1, 1) is not a solution because ^2(0, 1) = 0; and 
(1, 1, 1) is not a solution because -Fi(l, 1) = 0. Thus, M E 7|jniq- 
It is now straightforward to check that (M, u) \= ip.M 

In Section im I argued that the choice of language was significant. All the results of this section are 
stated for the language >Cuniq- In light of Lemma |2. 1[ Theorems 13 .41 and 13.51 apply without change to the 
language Moreover, since Tlmiq ^ T and >Cuniq ^ the formula -193 from Theorem l3.7l (which 
is in Cnniq, and hence also in £+) continues to be unsatisfiable in T, while being valid in counterfactual 
structures. While C5 is not valid in T, the generalization of C5 mentioned at the end of Section IZT] is 



valid in T |Halpem 2000 1, and is not valid in counterfactual structures. So the full class T of causal 
models is incomparable in expressive power to counterfactual structures with respect the language 
(which is the language arguably most appropriate for T). 



4 Discussion 

I have shown that the expressive power of causal models as models for counterfactuals is incomparable 
to that of the Lewis-Stalnaker "closest-world" possible-worlds semantics for counterfactuals; thus, the 
definition of counterfactuals in causal models in not, in general, compatible with the axioms of possible 
world semantics, although it is if we restrict to recursive causal models. 

Specifically, causal models where the equations are recursive can be viewed as defining a strict 
subclass of the standard possible-worlds semantics. More precisely, a set of structural equation define a 
world w (characterized by the unique solution to the equations) and can be implicitly viewed as defining 
a ordering relation on worlds such that, for every formula X = x, the solution to the equations when X 
is set to X determines a world w-^^^ that is the world closest to w according to the ordering such that 
X = X. Somewhat surprisingly, this is not the case if we go to the larger class of causal models defined 
by equations that are not recursive, but have a unique solution for all settings X = x. Of course, it is 
still the case that there is a world Wx^g determined by the equations when X is set to x. However, there 
is, in general, no ordering on worlds such that Wji^^ is the closest world to w according to the ordering. 
A closeness ordering on worlds places some restrictions (e.g., those characterized by the formula -199 in 
Theorem 13.71 ) that do not hold in all causal models in Tlmiq- 

So where does this leave us? It is still the case that, in causal models, a formula such as (/? ^ ^ is 
true at a world w HiIj i?, true at some appropriate world w' satisfying y?. However, w' cannot be viewed 
as the "closest" world to w satisfying y?. This leaves open the question of whether there are other ways 
of defining "appropriateness" other than "closeness". I do not have strong intuitions here, but it is a 
question that is perhaps worth pursuing. 

Turning to a more technical point, it is worth trying to understand in more detail exactly what 
goes wrong in the Galles-Pearl argument. When making the comparison, Galles and Pearl did not 
use the axiom system AX. Rather, they used one of Lewis' axiomatization of conditional logic from 
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IILewis 1 97311 ■ Nevertheless, the problem with their argument can be understood in terms of AX. Galles 
and Pearl show that all but one of the axioms in the system they consider follows from C3, C4, and C5. 
(They actually also implicitly use CI and C2, but this is a minor point.) The remaining axiom is 

[((<Pi V 992) >= ^) ^ (((^1 >= V-) V (932 >= i'))]. 

For this axiom, they say "Because actions in causal models are restricted to conjunctions of literals that 
is, in the language of this paper, because in a formula in £uniq of the form 99 ^ V» 9^ is a conjunction 
of formulas of the form X = x], this axiom is irrelevant." They thus ignore the axiom. Unfortunately, 
this argument is flawed. If it were true, then it would be the case that the class of causal models in TLniq 
would be less general than counterfactual structures since causal models would satisfy more axioms — all 
the relevant axioms satisfied by counterfactual structures, and, in addition, C5 (reversibility). However, 
as we have seen, the formula ip of Theorem 13.71 is satisfiable in Tlmiq, while -k/? is valid in My. The 
formula -195 is equivalent to 

{[Xi = 1](X2 = 1 A = 0) A [X2 = 1]{X; = 1 A Xi = 0) 
^-[X3 = l](Xi = 1AX2 = 0). 

Thus, is a formula whose antecedent and conclusion are both formulas in >Cumq- The argument 
for the validity of -k/? given in the proof of Theorem 13.71 is purely semantic, but, as As I show in the 
appendix, -199 can also be derived from AX together with V2 and V3. The derivation uses A4 in a 
crucial way. Note that A4 has disjunctions on the left-hand side of ^ and thus cannot be expressed in 
£uniq- But it is not irrelevant! Indeed, an easy argument (given in the appendix) shows that -k/? cannot 
be derived using AX — {A4} together with V2 and V3 if we restrict to formulas in £uniq- Thus, we can 
start with assumptions in the language >Cuniq> end up with a conclusion in £uniq> but have a derivation 
that, along the way, uses A4 and has steps that involve formulas with disjunctions on the left-hand side 
of ^. The Galles and Pearl argument ignores this possibility. 

Acknowledgments: Thanks to Franz Huber and Judea Pearl for useful comments. 

A A derivation of -k^ 

In this appendix, I show that the formula 

([Xi = 1](X2 = 1 A X3 = 0) A [X2 = 1](X3 = 1 A Xi = 0) ^ -[X3 = l](Xi = 1 A X2 = 0) 

can be derived from AX together with V2 and V3. To simplify notation, I write h ip' if the formula Lp' 
can be derived from AX, and l-+ p' if p' can be derived from AX -1- {V2, V3}. 

I first need a technical lemma. 
Lemma A.l h {pi ^ 932 A 932 ^ 9^3) =^ {'fi V P2) h ^3- 

Proof: By Al, h 992 ^ ^2, and by A4, 921 ^ 992 A 992 ^ 922 (<^i V 992) ^ V'- Thus, 

I- (vi ^ ^2) =^ {^1 V P2) h p^2- (1) 
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Next observe that it easily follows from Al and A4 that (a) h {ipi A -^^2) ^ {'^2 ^ V^s) (since 
{ifi A -■(/?2) ^ {^2 =^ 923) is a tautology) and (b) H (922 ^ V's) ^ 9^2 ^ ((/?2 ^ 953) (since 
ip3 =^ {ip2 =^ (fs) is a tautology). By A4, RAl, and the observation that {{ipi A -'V'2) V (P2) <^ {^1 V (/?2) 
is a tautology, we can conclude that 

^ (9^2 ^ ^3) {{vi V 992) h {^2 ^3))- (2) 
Let ip be the formula {tpi ^ 9?2 A (^2 ^ 9^3)- From ([T]! and ©, it follows that 
h V ^ {{(fl V 992) ^ 9^2) A {ifi V (/?2) ^ (932 ^ ¥'3))- 
Applying A2 and RAl, it follows that 

hip ^ {{ipi V 932) >Z V3, 

as desired. | 

Now the proof is easy. By Lemma [ATT] it follows that 

h (Xi = 1 ^ = 1) A (X2 = 1 >r X3 = 1) ^ 
(Xi = 1 VX2 = 1) >r X3 = 1. 

Applying the lemma again, we get that 

h (Xi = 1 V X2 = 1) >i X3 = 1 A (X3 = 1 )r Xi = 0) ^ 
(Xi = 1 V X2 = 1 V X3 = 1) ^ Xi = 0. 

Let tp' be an abbreviation for Xi = 1 V X2 = 1 V X3 = 1. Thus, h 93 ^ {ip' ^ Xi = 0). An analogous 
argument shows that h 99 =^ ^' ^ X2 = 0) and h 9? {tp' >z X3 = 0). By A2, we have that 

H 93 ^ >r (Xi = AX2 = AX3 = 0). (3) 

By V2, we have that h+ V' ^ (-'^1 / V X2 / V X3 / 0). Thus, by Al and RA2, we have that 

h+ 9P ^ (V-' ^ (Xi / AX2 / AX3 / 0). (4) 

By A2, RAl, dill, and ©, we have that 

h+ 99 ^ {iP' h false). 

By V3, it follows that l-+ -199, as desired. | 

Note that all the axioms in AX other than A4 can be expressed using the language £uniq (pro- 
vided that the formulas 99 and tp mentioned in these axioms are taken to be conjunctions of primitive 
propositions or just single primitive propositions, as appropriate). It is easy to check that all these ax- 
ioms are valid in TJmiq, as are V2 and V3. Since 99 is satisfiable in T^niq, ~"/9 cannot be proved from 
(AX - {A4}) U {V2,V3}. 
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